Semantic Understanding and Commonsense Reasoning

نویسنده

  • Xinyu Hugo Liu
چکیده

In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually annotate photos with keywords greatly burdens the user. Furthermore, even when photos are properly annotated, their retrieval is often very brittle because semantic connections between annotations and the story text that are "obvious" to people (e.g. between "bride" and "wedding") may easily be missed by the computer. ARIA (Annotation and Retrieval Integration Agent) is a software agent that acts as an assistant to a user writing e-mail or Web pages. As the user types a story, it does continuous retrieval and ranking on a photo database. It can use descriptions in the story text to semi-automatically annotate pictures based on how they are used. The focus of this thesis is threefold: Improving ARIA's automated annotation capabilities through world-aware semantic understanding of the text; making photo retrieval more robust by using a commonsense knowledge base, Open Mind Commonsense, to make semantic connections between the story text and annotations (e.g. connect "bride" and "wedding"); and learning personal commonsense through the text (e.g. "My sister's name is Mary.") that can then be used to improve photo retrieval by enabling personalized semantic connections. Thesis Supervisor: Dr. Henry Lieberman Title: Research Scientist, MIT Media Laboratory

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual common-sense for scene understanding using perception, semantic parsing and reasoning

In this paper we explore the use of visual commonsense knowledge and other kinds of knowledge (such as domain knowledge, background knowledge, linguistic knowledge) for scene understanding. In particular, we combine visual processing with techniques from natural language understanding (especially semantic parsing), common-sense reasoning and knowledge representation and reasoning to improve vis...

متن کامل

Reasoning with Heterogeneous Knowledge for Commonsense Machine Comprehension

Reasoning with commonsense knowledge is critical for natural language understanding. Traditional methods for commonsense machine comprehension mostly only focus on one specific kind of knowledge, neglecting the fact that commonsense reasoning requires simultaneously considering different kinds of commonsense knowledge. In this paper, we propose a multi-knowledge reasoning method, which can expl...

متن کامل

Understanding Stories with Large-Scale Common Sense

Story understanding systems need to be able to perform commonsense reasoning, specifically regarding characters’ goals and their associated actions. Some efforts have been made to form large-scale commonsense knowledge bases, but integrating that knowledge into story understanding systems remains a challenge. We have implemented the Aspire system, an application of large-scale commonsense knowl...

متن کامل

Towards Understanding Natural Language: Semantic Parsing, Commonsense Knowledge Acquisition and Applications

There are various aspects of making computers understand natural language. Semantic parsing and reasoning on commonsense knowledge are the two important ones. Many NLU tasks such as question answering and co-reference resolution require semantic parsing of text and reasoning with different kinds of commonsense knowledge. In this work we present our progress towards these milestones of NLU. We d...

متن کامل

Ethnomethodology and Conversational Analysis

In a speech community, people utilize their communicative competence which they have acquired from their society as part of their distinctive sociolinguistic identity. They negotiate and share meanings, because they have commonsense knowledge about the world, and have universal practical reasoning. Their commonsense knowledge is embodied in their language. Thus, not only does social life depend...

متن کامل

Semantic Understanding and Commonsense Reasoning in an Adaptive Photo Agent

In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014